Method and Device for Generating a Predicted Value of an Image

ABSTRACT

A method and device for generating a predicted value of image that are mostly used to generate a predicted value of a current block during image encoding or decoding. The method includes: determining a searching scope, wherein multiple motion vectors are included in the searching scope; performing up-sampling interpolations on first reference blocks, corresponding to the motion vector in the searching scope, in a reference image of the current block by using a first filter to obtain up-sampled first reference blocks; by using the up-sampled first reference blocks, obtaining at least one candidate motion vector corresponding to the current block; performing up-sampling interpolations on second reference blocks, corresponding to the at least one candidate motion vector, in the reference image of the current block by using a second filter to obtain up-sampled second reference blocks; combining the up-sampled second reference blocks to obtain a predicted value of the current block.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application a continuation of application Ser. No. 13/738,773,filed on Jan. 10 2013, which is a continuation of InternationalApplication No. PCT/CN2011/075034, filed on May 31, 2011, which claimspriority to Chinese Patent Application No. 201010227642.7, filed on Jul.10, 2010, The afore-mentioned patent applications are herebyincorporated by reference in their entireties.

TECHNICAL FIELD

The present invention relates to image encoding/decoding techniques inthe field of signal processing. More particularly, the present inventionrelates to a method and device for generating a predicted value of animage.

BACKGROUND

As the continuous development of video encoding techniques, intra-framemotion searching and compensating techniques are capable of effectivelyremoving time redundancy of video contents, greatly improving theefficiency of encoding; motion information is obtained through motionestimation and is then transmitted to a decoder where a video image ispredicted and reconstructed by means of a method of motion compensation.Intra-frame motion compensation prediction technique means performingmotion compensation by using motion vectors obtained through search in aprocess of encoding and decoding, so as to realize correspondingencoding and decoding operations.

As an encoding and decoding method, a technique for derivable motionvectors on decoder becomes more and more familiar to people for itsgreat contributions to the encoding efficiency, and its great values arerecognized by people, while being accepted as an important candidatetool for future video encoding standard.

A method from among the conventional techniques for derivable motionvectors for acquiring motion information comprises: obtaining a motionvector through template matching. Specifically, assuming that thecurrent block is a macro block in frame B, as shown in FIG. 1. First, atemplate TM is constructed around the current block by usingreconstruction values, and then motion vectors MV0 and MV1 are obtainedas the motion vectors of the current block by performing search matchingusing a template of the same shape as the template TM in a forwardreference image (Ref List0) and a backward reference image (Ref List1)of the block, a predicted value of the current block is obtained at thesame time, and a predicted residual decoding is performed on thepredicted value.

In the implementation of the present invention, the inventor found thatat least the following defects exist in the prior art: the encoder doesnot transmit the motion vector of a predicted value to the decoder, andthe decoder needs to obtain, before restoring a current block, themotion vector of the current block by means of a motion search processwhich increases the complexity of the decoder.

SUMMARY

An object of the embodiments of the present invention is to provide amethod and device for generating a predicted value of an image, so as tolower the complexity of a decoder.

According to an embodiment of the present invention, there is provided amethod for generating a predicted value of an image, which is used togenerate a predicted value of a current block during image encoding, themethod comprising the steps of: determining a searching scope, whereinmultiple motion vectors are included in the searching scope; performingup-sampling interpolations on first reference blocks by using a firstfilter to obtain up-sampled first reference blocks, wherein the firstreference blocks are the reference blocks in a reference image of thecurrent block corresponding to the motion vectors in the searchingscope; obtaining at least one candidate motion vector corresponding tothe current block by using the up-sampled first reference blocks;performing up-sampling interpolations on second reference blocks byusing a second filter to obtain up-sampled second reference blocks,wherein the second reference blocks are the reference blocks in thereference image of the current block corresponding to the at least onecandidate motion vector; and combining the up-sampled second referenceblocks to generate a predicted value of the current block.

According to another embodiment of the present invention, there isprovided a device for generating a predicted value of an image, which isused to generate a predicted value of a current block during imageencoding, the device comprising: a determining unit configured todetermine a searching scope, wherein multiple motion vectors areincluded in the searching scope; a first up-sampling unit configured toperform up-sampling interpolations on first reference blocks by using afirst filter to obtain up-sampled first reference blocks, wherein thefirst reference blocks are the reference blocks in a reference image ofthe current block corresponding to the motion vectors in the searchingscope; a motion searching unit configured to obtain at least onecandidate motion vector corresponding to the current block by using theup-sampled first reference blocks; a second up-sampling unit configuredto perform up-sampling interpolations on second reference blocks byusing a second filter to obtain up-sampled second reference blocks,wherein the second reference blocks are the reference blocks in thereference image of the current block; and a combining unit configured tocombine the up-sampled second reference blocks to generate a predictedvalue of the current block.

The embodiments of the present invention lower the complexity of theencoder and decoder to a certain extent by using known informationaround to perform predictive search, and at the same time, referenceimage with subpixel precision may be obtained by using two filters toperform interpolations on a reference image, improving the performanceof the decoder.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to more clearly describe the technical solutions of the priorart or the embodiments of the present invention, the drawings to be usedin the descriptions of the prior art or the embodiments are brieflyintroduced as follows. Obviously, the following drawings just illustratesome embodiments of the present invention, and a person skilled in theart can obtain other drawings from these drawings without paying acreative effort.

FIG. 1 is a schematic diagram of obtaining a motion vector in the priorart;

FIG. 2 is a flowchart of a method for generating a predicted value of animage according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of obtaining a candidate motion vector setof a current block;

FIG. 4 is a schematic diagram of obtaining a current template and areference template of a current block;

FIG. 5 is a schematic diagram of performing up-sampling interpolationson a reference image by using a filter;

FIG. 6 is a flowchart of a method for generating a predicted value of animage according to another embodiment of the present invention;

FIG. 7 is a flowchart of a method for generating a predicted value of animage according to another embodiment of the present invention;

FIG. 8 is a schematic diagram of performing up-sampling interpolationson a reference image by using a simple filter;

FIG. 9 is a structure for implementing a method for generating apredicted value of an image according to another embodiment of thepresent invention;

FIG. 10 is a structure for implementing a method for generating apredicted value of an image according to another embodiment of thepresent invention;

FIG. 11 is a block diagram of a device for generating a predicted valueof an image according to an embodiment of the present invention;

FIG. 12 is a block diagram of a motion searching unit according to anembodiment of the present invention; and

FIG. 13 is a block diagram of a motion searching unit according toanother embodiment of the present invention.

DETAILED DESCRIPTION

The technical solutions of the embodiments of the present invention willbe clearly and completely described as follows in conjunction with thedrawings. Apparently, the described embodiments are just a part ofembodiments of the present invention rather than all the embodiments.Based on the embodiments of the present invention, any other embodimentobtained by a person skilled in the art without paying a creative effortwill fall within the protection scope of the present invention.

Referring to FIG. 2, a method for generating a predicted value of animage according to an embodiment of the present invention is shown, themethod comprising the steps of:

S201: for a current block Curr_Blk to be encoded, obtaining a currenttemplate and a candidate motion vector set of the current block.

A template, for example, TM1 shown in FIG. 4, is constructed by usingreconstructed pixels around the current block Curr_Blk. A candidatemotion vector set is constructed by using known motion vectors aroundthe current block, such as MVL, MVU and MVR shown in FIG. 3, and themotion vectors of the same positions in the previous and next frames,such as colMV1, colMV2 . . . colMV4, etc.

S202: selecting a candidate motion vector from the candidate motionvector set, and finding a reference template TM2 of the current templateTM1 from a reference image by using the selected candidate motionvector.

As shown in FIG. 4, it is assumed that a reference template TM2 of thecurrent template TM1 is found in the reference image by using acandidate motion vector MVL. If MVL is an integer pixel shift vector,TM2 is directly obtained by offsetting the MVL in the reference image.

If MVL is a subpixel shift vector, up-sampling is performed on thereference image; for example, first, up-sampling on an originalreference image is performed by using a 6-tap filter [1, −5, 20, 20, −5,1] or an adaptive interpolation filter (AIF) in H.264/AVC Standard toobtain the up-sampled reference image, and then the subpixel offset MVLis moved in the sampled reference image to obtain TM2.

In order to obtain a fine motion vector, a subpixel search may befurther performed within neighborhood of surrounding subpixel distancefrom the current candidate motion vector. For example, as shown in FIG.4, position 0 indicates the position of the current candidate motionvector, and positions 1, 2, . . . , 8 indicate neighborhood positions ofsubpixel distances from the position indicated by the current candidatemotion vector. If it is at a subpixel position of the reference image,such as a half pixel position, up-sampling on an original referenceimage is first performed by using a filter (such as a 6-tap filter [1,−5, 20, 20, −5, 1] in H.264/AVC Standard) to obtain the up-sampledreference image (the reference image here may not be a whole image, butimage blocks of the reference image), and then the subpixel offset MVLis moved in the sampled reference image to obtain TM2.

S203: calculating a matching difference cost between TM2 and TM1.

In particular, a sum of absolute differences (SAD), a sum of absolutetransformation differences, or a sum of absolute square differences, andof course, some other parameters describing the similarity between tworeference blocks, may be used.

A motion vector is selected from those unselected motion vectors in thecandidate motion vector set, and steps S202 and S203 are repeatedlyexecuted; in this manner, matching differences cost1, cost2, cost3 . . .between the current template and the reference template to which eachcandidate motion vector corresponds are obtained.

S204: sequencing the obtained matching differences cost, then selectinga motion vector with the minimum matching difference cost as the motionvector of the current block.

S205: selecting at least one motion vector with minimum cost, thenfinding the corresponding reference blocks in the reference image.

S206: calculating a predicted value of the current block according tothe motion vector of the current block.

First, several (for example, 4) motion vectors with the minimum matchingdifference are selected, and the corresponding reference blocks arefound in the reference image; if the motion vectors are subpixel motionvectors, up-sampling is then performed on the reference blocks by usinga filter motion vector same as that used in the motion search (such as a6-tap filter [1, −5, 20, 20, −5, 1] or an adaptive interpolation filterin H.264/AVC Standard), to thereby obtain the up-sampled blocks aspredicted values of the current block; these four predicted values withminimum matching difference are combined to generate the predicted valueof the current block; for example, the four predicted values withminimum matching difference are averaged as the predicted value of thecurrent block, to thereby predict the current block.

At the encoder, a residual value is obtained by subtracting thepredicted value from the current original block, the residual value istransformed and quantized and is encoded into a code stream. At thedecoder, the residual value is decoded and a restored value of thecurrent block is obtained by adding the predicted value to the residualvalue.

In this embodiment, it is possible to lower the complexity of a decoderto a certain extent by using the known motion vectors around as thecandidate motion vectors to perform predictive search; however, thecomplexity of encoder and decoder is relatively high as compared with aconventional method in which motion information is transmitted in codestreams. After conducting an analysis, the inventor found that theup-sampling process in the motion search is one of the steps of themethod consuming the calculation resources most.

Referring to FIG. 5, in order to further lower the complexity of encoderand decoder, the present invention provides a method for generating apredicted value of an image according to another embodiment, which isused to generate a predicted value of a current block during imageencoding, the method comprising the steps of:

S501: determining a searching scope, wherein multiple motion vectors areincluded in the searching scope.

The searching scope represents a motion vector set to be searched. Thereexist many approaches for determining a searching scope, and accordingto one embodiment, a set of candidate motion vectors of the currentblock is obtained as the searching scope; according to anotherembodiment, a search starting point of the current template is obtainedaccording to the motion vectors around the current block, and thesearching scope is obtained according to the search starting point and apredefined searching region.

S502: performing up-sampling interpolations on first reference blocks byusing a first filter to obtain up-sampled first reference blocks,wherein the first reference blocks are reference blocks in a referenceimage of the current block corresponding to the motion vector in thesearching scope.

The reference image refers to a restored image around the image to whichthe current block belongs, and is used for restoring the current block.The reference image may be a reference frame, or a reference block, or areference field picture.

The above step may be implemented in various forms. According to oneembodiment, up-sampling interpolation could be first performed on thereference image by using a first filter such that a first referenceimage is obtained; and then the up-sampled first reference blockscorresponding to the motion vectors within the searching scope areobtained in the first reference image.

In another embodiment, first reference blocks corresponding to themotion vectors within the searching scope could be first obtained from areference image; and then up-sampling interpolation on the firstreference blocks is performed using a first filter, to thereby obtainthe up-sampled first reference blocks. Up-sampling processing, which isonly performed on the reference image blocks corresponding to theposition indicated by the motion vectors, may lower the complexity ofthe algorithm and improve the efficiency.

S503: obtaining at least one candidate motion vector corresponding tothe current block by using the up-sampled first reference blocks.

Such step is a process of motion search. A motion search means amatching difference between two blocks corresponding to each searchingposition in the up-sampled first reference blocks. If two referenceimages are selected for the current block, during the process of motionsearch, a matching difference between two up-sampled first blockscorresponding to the two reference images needs to be calculated; and ifonly one reference image is selected for the current block, during theprocess of motion search, a current template of the current block and acorresponding reference template in the up-sampled first referenceblocks need to be obtained, and then a matching difference between thecurrent template and the reference template is calculated; of course, atthe time when two reference images are selected, a combination of theabove two matching differences may also be used, that is, a sum of thematching difference between two up-sampled first reference blockscorresponding to the two reference images and the matching differencebetween the current template and at least one corresponding referencetemplate in the up-sampled first reference blocks is calculated.

The term “motion search” as used herein may be replaced with other termsin different places. For example, such technological terms as matchingdifference calculation and search matching are almost technologicallyequivalent to motion search, each of which representing a calculation ofa matching difference between two blocks at each search position. Amatching difference refers to a parameter of similarity between tworeference blocks. In particular, SAD, sum of absolute transformationdifferences, or sum of absolute square differences, and of course, someother parameters describing the similarity between two reference blocks,may be used.

At least one motion vector of minimum matching difference may beselected as a candidate motion vector.

S504: performing up-sampling interpolations on second reference blocksby using a second filter to obtain up-sampled second reference blocks,wherein the second reference blocks are reference blocks in thereference image of the current block corresponding to the selectedcandidate motion vectors.

This step may be executed in various forms. In an embodiment, a secondfilter may be used first to perform up-sampling interpolations on thereference image, to obtain a second reference image; and then from thesecond reference image, up-sampled second reference blocks correspondingto the motion vectors within the searching scope are obtained.

In another embodiment, second reference blocks corresponding to themotion vectors within the searching scope may be obtained first from thereference image; and then the second filter is used to performup-sampling interpolations on the second reference blocks, to obtainup-sampled second reference blocks. Up-sampling processing being onlyperformed on the reference image corresponding to the position indicatedby the motion vectors may lower the complexity of the algorithm andimprove the efficiency.

S505: combining the up-sampled second reference blocks to obtain apredicted value of the current block.

In still another embodiment, the tap number of the first filter is lessthan that of the second filter.

In the above embodiments, it is possible to lower the complexity of thedecoder to a certain extent by using the surrounding known informationto perform predictive search, and the use of two filters in performinginterpolations on the reference signals at the same time can obtain areference image with subpixel precision, which leads to an improvementof the performances of the decoder. Furthermore, during a motion search,the use of a filter, which is simpler than the filter used in motioncompensation may maintain the properties and lower the complexity ofcalculation. This meets the requirements on modularization of the designof decoder hardware, reduces the frequencies of reading data, improvesthe efficiency of data reading, reduces the times of module switching,and makes the template techniques to be applied to the macro blockdivision. Moreover, in the process of up-sampling processing,up-sampling processing being only performed on the reference imagecorresponding to the position indicated by the motion vectors can lowerthe complexity of the algorithm and improve the efficiency.

Referring to FIG. 6, another embodiment of the method for generating apredicted value of an image provided in the present invention comprisesthe following steps:

S601: for a current block Curr_Blk to be encoded, obtaining a currenttemplate and a current candidate motion vector set.

The current block may be a whole macro block, or one of the dividedblocks of a macro block. Here, a whole macro block is taken as anexample.

The current template, for example, TM1 shown in FIG. 4, is obtained byusing reconstructed pixels around the current block. Candidate motionvectors are constituted by a set of candidate motion vectors obtained byusing known motion vectors around the current block, such as MVL, MVUand MVR shown in FIG. 2, and the motion vectors of the same position inthe previous and next frames, such as colMV1, colMV2 . . . colMV4, etc.

The candidate motion vector set may comprise motion vectors MVL, MVU andMVR of surrounding blocks spatially related to the current blockCurr_Blk and a motion vector mid-value MEAN (MVL, MVU, MVR), motionvectors in the blocks of the same position in a forward reference framefn−1 and motion vectors in surrounding blocks of the blocks of the sameposition, and motion vectors in the blocks of the same position in abackward reference frame fn+1 and all or part of motion vectors insurrounding blocks of the blocks of the same position, that is, itcomprises a left block, an upper block, a left upper block, a rightupper block, of the current block, all or part of a left block, an upperblock, a left upper block, a right upper block, of the blocks in thereference frame of the same position as the current block, and theblocks in the reference frame of the same position as the current block.

S602: obtaining reference images, and performing up-sampling on thereference images by using the first filter to obtain a first referenceimage with subpixel precision, wherein the first filter is a simplefilter.

Reference images are obtained, and up-sampling is performed on thereference images by using a simple filter filter1, such as a bilinearfilter having a filtering coefficient of [1,1] or a 3-tap filter[1,2,1], to obtain a first reference image with subpixel precision(which may be ½ pixel precision, or ¼ pixel precision, or even ⅛ pixelprecision).

The detailed method is shown in FIG. 7, wherein A, B, C, D are selectedwhole pixel points of the reference image, b, h, j, m, s are ½ pixelpoints, and a, c, d, e, f, g, i, k, n, p, q, r are ¼ pixel points.First, interpolations are performed on the whole pixel points A, B, C, Dof the reference image by using the simple filter filter1 so that the ½pixel points b, h, j, m, s are obtained, constructing a first referenceblocks of ½ pixel precision; next, interpolations are performed on thewhole pixel points A, B, C, D and the ½ pixel points b, h, j, m, s byusing the simple filter filter1 so that the ¼ pixel points a, c, d, e,f, g, i, k, n, p, q, r are obtained, constructing a first referenceblocks of ¼ pixel precision. For example:

b=(A+B+1)/2;

s=(C+D+1)/2;

j=(b+s+1)/2;

S603: for each candidate motion vector, obtaining a reference templatecorresponding to the current template from the first reference image.

In the obtained first reference blocks, by using the candidate motionvector, the reference template TM2 corresponding to the positionindicated by the candidate motion vector or the reference template TM2corresponding to the neighborhood position of the subpixel distance ofthe position indicated by the candidate motion vector is found, as shownin FIG. 4.

S604: calculating matching differences between the current template andeach reference template corresponding to each candidate motion vector,and selecting the motion vector corresponding to the reference templatewith minimum matching difference as the motion vector of the currentblock;

The matching difference refers to a parameter for the similarity betweenthe current template and the reference template. For example, SAD isused to calculate a sum of absolute value difference of the pixeldifference between the current block template area and the correspondingarea of the reference template, or sum of absolute square differences(SSD) may be used to calculate a sum of square of the pixel differencebetween the current block template area and the corresponding area ofthe reference template.

Steps S603 and S604 are repeated for each candidate motion vector in thecandidate motion vector set to obtain matching differences cost1, cost2,cost3 . . . between the current template and the reference template towhich each candidate motion vector corresponds.

The matching differences cost between TM2 and TM1 obtained throughcalculation are sequenced, and a motion vector corresponding to areference template with the minimum matching difference is selected asthe motion vector of the current block.

S605: performing up-sampling on the reference images by using a secondfilter, to obtain second reference blocks with subpixel precision,wherein the tap number of the first filter is less than that of thesecond filter.

A motion compensating filter filter2, such as a 6-tap filter [1, −5, 20,20, −5, 1] or an adaptive filter in H.264/AVC Standard is used toperform interpolations, as shown in FIG. 3, to generate a secondreference image. Referring to FIG. 8, a particular interpolationgenerating method is as follows:

First, ½ pixel points are generated by using whole pixel points A, B, C,D, E, F, G H, I, J, K, L, M, N, P, Q, R, S, T, U through the motioncompensating filter filter2. Afterwards, ¼ pixel points are generated byusing the whole pixel points and the ½ pixel points to obtain referenceblocks with subpixel precision.

S606: selecting at least one motion vector corresponding to at least onereference template with minimum matching difference to obtaincorresponding up-sampled second reference blocks from the secondreference image respectively, and combining the obtained at least oneup-sampled second reference block to obtain the predicted value of thecurrent block.

The at least one motion vector corresponding to the minimum matchingdifference is the candidate motion vector.

Multiple motion vectors corresponding to the first several (for example,4) minimum matching differences cost between TM2 and TM1 are selected,and the corresponding reference blocks trueRef are found from the secondreference image. These reference blocks are combined to generate thepredicted value pred of the current block. For example, an average valueof the four predicted values of the minimum matching difference is takenas the predicted value of the current block, to predict the currentblock.

In another embodiment, in order to lower the complexity of processing,it is unnecessary to perform up-sampling on the whole reference image,and up-sampling may only be performed on part of the reference imagecontaining the content corresponding to the candidate motion vector.This embodiment comprises the following steps:

S6011: for a current block Curr_Blk to be encoded, obtaining a currenttemplate and current candidate motion vectors.

The detailed procedure for this step is similar to that of the lastembodiment.

S6021: obtaining reference image, obtaining first reference blockscorresponding to the candidate motion vectors from the reference image,and performing up-sampling on the first reference blocks by using afirst filter, to obtain a first reference image with subpixel precision,wherein the first filter is a simple filter.

Reference images are obtained. Then, the first reference blockscorresponding to the candidate motion vectors are obtained from thereference images, and up-sampling is performed on the first referenceblocks by using a simple filter filter1, such as a bilinear filterhaving a filtering coefficient of [1,1] or a 3-tap filter [1,2,1], toobtain a first reference image with subpixel precision (which may be ½pixel precision, or ¼ pixel precision, or even ⅛ pixel precision).

The detailed method is shown in FIG. 5, wherein A, B, C, D are selectedwhole pixel points of the first reference blocks, b, h, j, m, s are ½pixel points, and a, c, d, e, f, g, i, k, n, p, q, r are ¼ pixel points.First, interpolations are performed on the whole pixel points A, B, C, Dof the first reference blocks by using the simple filter filter1 so thatthe ½ pixel points b, h, j, m, s are obtained, constructing up-sampledfirst reference blocks of ½ pixel precision; next, interpolations areperformed on the whole pixel points A, B, C, D and the ½ pixel points b,h, j, m, s by using the simple filter filter1 so that the ¼ pixel pointsa, c, d, e, f, g, i, k, n, p, q, r are obtained, constructing up-sampledfirst reference blocks of ¼ pixel precision. For example:

b=(A+B+1)/2;

s=(C+D+1)/2;

j=(b+s+1)/2;

S6031: for each candidate motion vector, obtaining a reference templatecorresponding to the current template from the up-sampled firstreference blocks.

In the obtained first reference blocks, by using the candidate motionvector, the reference template corresponding to the position indicatedby the candidate motion vector, or the reference template TM2corresponding to the neighborhood position of the subpixel distance ofthe position indicated by the candidate motion vector is found, as shownin FIG. 4.

S6041: calculating matching differences each between the currenttemplate and each reference template, and selecting the motion vectorcorresponding to the reference template with minimum matching differenceas the motion vector of the current block;

S6051: selecting at least one motion vector corresponding to at leastone reference template with minimum matching difference, and obtainingcorresponding second reference blocks from the reference imagesrespectively.

The at least one motion vector corresponding to the minimum matchingdifference is the candidate motion vector.

S6061: performing up-sampling on the second reference blocks by using asecond filter, to obtain second reference blocks with subpixelprecision, combining the at least one obtained second reference block toobtain the predicted value of the current block, wherein the tap numberof the first filter is less than that of the second filter.

Interpolations are performed by using a motion compensating filterfilter2, such as a 6-tap filter [1, −5, 20, 20, −5, 1] or an adaptivefilter in H.264/AVC Standard, to generate up-sampled second referenceblocks. With referring to FIG. 8, a particular interpolation generatingmethod is as follows:

generating ½ pixel points by using whole pixel points A, B, C, D, E, F,G, H, I, J, K, L, M, N, P, Q, R, S, T, U through the motion compensatingfilter filter2, and then generating ¼ pixel points by using the wholepixel points and the ½ pixel points, to obtain the second referenceblocks with subpixel precision.

These up-sampled second reference blocks are combined to generate thepredicted value pred of the current block. For example, an average valueof the four up-sampled second reference blocks is taken as the predictedvalue of the current block, so as to predict the current block.

The method may be applicable to an encoder, or to a decoder. For theencoder, after the predicted value of the current block is obtained, aresidual value between the current block and the predicted value iscalculated, and then transmitted to the decoder after being encoded. Forthe decoder, a restored value of the current block is obtained by addingthe residual value of the current block obtained through decoding andthe combined predicted value pred of the current block.

In accordance with this embodiment, during a motion search, the use offilter, which is simpler than the filter used in motion compensation,may maintain the properties and lower the complexity of calculation.Also, it is possible to lower complexity without compromising theprecision, which is achieved by obtaining the candidate motion vectorrequired for the motion compensation process using the matchingdifference between the current template and the reference templateduring the motion search process, and by performing predictive searchusing the surrounding known information. Moreover, in the process ofup-sampling, up-sampling is performed only on the reference blockscorresponding to a position indicated by the motion vectors, which leadsto a simpler algorithm and higher efficiency.

Referring to FIG. 9, another embodiment of the method for generating apredicted value of an image provided in the present invention comprisesthe following steps:

S901: for a current block Curr_Blk to be encoded, obtaining a currenttemplate, and obtaining a search starting point of the current templateaccording to motion vectors around the current block.

The current template, for example, TM1 as shown in FIG. 4, is obtainedby using reconstructed pixels around the current block. And a searchstarting point of the current template is obtained by using known motionvectors around the current block, such as MVL, MVU and MVR shown in FIG.3.

S902: obtaining reference images, and performing up-sampling on thereference images by using a first filter to obtain a first referenceimage with subpixel precision, wherein the first filter is a simplefilter.

The reference images are obtained, and a first reference image withsubpixel precision (which may be ½ pixel precision, or ¼ pixelprecision, or even ⅛ pixel precision) is obtained by performingup-sampling on the reference image using a simple filter filter1, suchas a bilinear filter having a filtering coefficient of [1,1]. Thedetailed method is shown in FIG. 7, wherein A, B, C, D are selectedwhole pixel points of the reference image, b, h, j, m, s are ½ pixelpoints, and a, c, d, e, f, g, i, k, n, p, q, r are ¼ pixel points.First, interpolations are performed on the whole pixel points A, B, C, Dof the reference image by using the simple filter filter1 so that the ½pixel points b, h, j, m, s are obtained, constructing a first referenceblocks of ½ pixel precision; next, interpolations are performed on thewhole pixel points A, B, C, D and the ½ pixel points b, h, j, m, s byusing the simple filter filter1 so that the ¼ pixel points a, c, d, e,f, g, i, k, n, p, q, r are obtained, constructing a first referenceblocks of ¼ pixel precision. For example:

b=(A+B+1)/2;

s=(C+D+1)/2;

j=(b+s+1)/2;

S903: determining a searching scope, and for each searching point in thesearching scope, obtaining a reference template corresponding to thecurrent template from the first reference image.

The searching scope is determined according to the search starting pointand a predefined searching region, and a reference template TM2corresponding to each searching point in the searching scope is foundfrom the obtained first reference image. The searching scope isdetermined by the search starting point and a predefined searchingregion, and the searching region is determined through negotiationbetween an encoder and a decoder. According to one example, an encoderdetermines the searching region and transmits the searching region to adecoder. For another example, an encoder and a decoder are configuredwith an identical searching region in advance.

S904: calculating matching differences between the current template andeach reference template, and selecting the motion vector correspondingto the reference template with minimum matching difference as the motionvector of the current block.

In particular, sum of absolute differences (SAD), sum of absolutetransformation differences, or sum of absolute square differences, andof course, some other parameters describing the similarity between tworeference blocks, may be used.

The matching differences cost between TM2 and TM1 obtained throughcalculation are sequenced, and a motion vector corresponding to areference template with the minimum matching difference is selected asthe motion vector of the current block.

S905: performing up-sampling on the reference images by using the secondfilter to obtain a second reference image with subpixel precision,wherein the tap number of the first filter is less than that of thesecond filter.

Interpolations are performed by using a motion compensating filterfilter2, such as a 6-tap filter [1, −5, 20, 20, −5, 1] or an adaptivefilter in H.264/AVC Standard, to generate true reference blocks trueRef.A particular interpolation generating method is as follows:

generating ½ pixel points by using whole pixel points A, B, C, D, E, F,G, H, I, J, K, L, M, N, P, Q, R, S, T, U through the motion compensatingfilter filter2, then generating ¼ pixel points by using the whole pixelpoints and the ½ pixel points, and finally obtaining the referenceblocks with subpixel precision by using the whole pixel points, the ½pixel points as well as the ¼ pixel points.

S906: selecting multiple motion vectors corresponding to referencetemplates with the minimum matching difference to obtain correspondingreference blocks from the second reference image respectively, andcombining the obtained reference blocks to generate the predicted valueof the current block.

At least one motion vector corresponding to the minimum matchingdifference is the candidate motion vector. Motion vectors correspondingto the first several (for example, 4) minimum matching differences costof TM2 and TM1 are selected, and corresponding reference blocks trueRefare found from the second reference image. These reference blocks arecombined to generate the predicted value pred of the current block. Forexample, an average value of the four predicted values of minimummatching difference is taken as the predicted value of the currentblock, so as to predict the current block.

In order to lower the complexity of processing, it is unnecessary toperform up-sampling on the whole reference image, and up-sampling mayonly be performed on part of the reference image containing the contentcorresponding to the candidate motion vector.

In another embodiment, the steps S902 and S903 may be substituted by thefollowing steps:

S9021: obtaining first reference blocks corresponding to the candidatemotion vectors from the reference images, and performing up-samplinginterpolations on the first reference blocks to obtain the up-sampledfirst reference blocks.

The detailed up-sampling process is similar to the step S902.

S9031: for each candidate motion vector, obtaining a reference templatecorresponding to the current template from the up-sampled firstreference blocks;

The detailed up-sampling process is similar to the step S903.

In another embodiment, the steps S905 and S906 may be substituted by thefollowing steps:

S9051: selecting at least one candidate motion vector corresponding toat least one reference template with minimum matching difference, toobtain second reference blocks corresponding to the candidate motionvectors from the reference images, and performing up-samplinginterpolations on the second reference blocks by using a second filterto obtain the up-sampled second reference blocks;

The detailed up-sampling process is similar to the step S905.

S9061: combining the obtained at least one up-sampled second referenceblock to generate the predicted value of the current block.

The detailed up-sampling process is similar to the step S906.

The method may be applicable to an encoder, or to a decoder. For theencoder, after the predicted value of the current block is obtained, aresidual value between the current block and the predicted value iscalculated, and then transmitted to the decoder after being encoded. Forthe decoder, a restored value of the current block is obtained by addingthe residual value of the current block obtained through decoding andthe combined predicted value pred of the current block.

In accordance with this embodiment, during a motion search, the use offilter, which is simpler than the filter used in motion compensation,may maintain the properties and lower the complexity of calculation.Also, it is possible to lower complexity without compromising theprecision, which is achieved by obtaining the candidate motion vectorrequired for the motion compensation process using the matchingdifference between the current template and the reference templateduring the motion search process, and by performing predictive searchusing the surrounding known information. Moreover, in the process ofup-sampling, up-sampling is performed only on the reference blockscorresponding to a position indicated by the motion vectors, which leadsto a simpler algorithm and higher efficiency.

Referring to FIG. 10, the method for generating a predicted value of animage provided in the present invention comprises the following steps:

S1001: for a current block Curr_Blk needed to be encoded, obtaining acurrent template and current candidate motion vectors.

The current template, for example, TM1 as shown in FIG. 4, is obtainedby using reconstructed pixels around the current block. Candidate motionvectors are constituted by using known motion vectors around the currentblock, such as MVL, MVU and MVR shown in FIG. 3, and the motion vectorsof the same positions in the previous and next frames, such as colMV1,colMV2 . . . colMV4, etc.

S1002: obtaining two reference images, and performing up-sampling on thereference images by using a first filter to obtain a first referenceimage 1 and a first reference image 2 with subpixel precision, whereinthe first filter is a simple filter.

Two reference images of the current block are obtained, namely, aforward reference image and a backward reference image. Up-sampling isperformed on these two reference images by using a simple filterfilter1, such as a bilinear filter having a filtering coefficient of[1,1], to obtain a virtual reference image with subpixel precision(which may be ½ pixel precision, or ¼ pixel precision, or even ⅛ pixelprecision). The detailed up-sampling interpolation method is shown inFIG. 8, wherein A, B, C, D are selected whole pixel points of thereference image, b, h, j, m, s are ½ pixel points, and a, c, d, e, f, g,i, k, n, p, q, r are ¼ pixel points. First, interpolations are performedon the whole pixel points A, B, C, D of the reference image by using thesimple filter filter1 so that the ½ pixel points b, h, j, m, s areobtained, constructing a first reference blocks of ½ pixel precision;next, interpolations are performed on the whole pixel points A, B, C, Dand the ½ pixel points b, h, j, m, s by using the simple filter filter1so that the ¼ pixel points a, c, d, e, f, g, i, k, n, p, q, r areobtained, constructing a first reference blocks of ¼ pixel precision. Inthis way, a first reference image 1 virtualRef1 is obtained. A firstreference image 2 virtualRef2 is obtained in the same manner.

S1003: for each candidate motion vector, obtaining an up-sampled firstreference block 1 and an up-sampled first reference block 2 from thefirst reference image 1 and the first reference image 2, respectively.

For each candidate motion vector, an up-sampled first reference blocks 1and an up-sampled first reference blocks 2 are obtained from the firstreference image 1 and the first reference image 2, respectively. Theup-sampled first reference block 1 and first reference block 2corresponding to each candidate motion vector is a group of up-sampledfirst reference blocks.

From the obtained first reference block 1 virtualRef1, by using acandidate motion vector, the up-sampled first reference block 1corresponding to the position indicated by the candidate motion vector,or an up-sampled first reference block 1 virBlk1 corresponding to theneighborhood position of the subpixel distance of the position indicatedby the candidate motion vector is found. Further, a first referenceblock 2 virBlk2 is obtained at a symmetrical position on anotherreference image virtualRef2. A matching difference cost between thefirst reference block 1 virBlk1 and the first reference block 2 virBlk2.In particular, SAD, sum of absolute transformation differences, or sumof absolute square differences, and of course, some other parametersdescribing the similarity between two reference blocks, may be used.

Moreover, the reference templates in the current block and in the firstreference image may be calculated, the matching difference between thecurrent block and the reference block and the matching differencebetween the current template and the reference template may becalculated, and an overall matching difference cost is obtained bycalculating the sum of the two matching differences.

S1004: calculating matching differences between the up-sampled firstreference blocks in each group, and selecting the motion vector with theminimum matching difference as the motion vector of the current block.

The matching differences between the up-sampled first reference blocksobtained through calculation are sequenced, and a motion vectorcorresponding to a reference template with the minimum matchingdifference is selected as the motion vector of the current block.

S1005: performing up-sampling on the reference images by using a secondfilter to obtain second reference blocks with subpixel precision,wherein the tap number of the first filter is less than that of thesecond filter.

True reference blocks trueRef are generated by performing interpolationsusing a motion compensating filter filter2, such as a 6-tap filter [1,−5, 20, 20, −5, 1] in H.264/AVC Standard. A particular interpolationgenerating method is as follows:

generating ½ pixel points by the motion compensating filter filter2 byusing whole pixel points A, B, C, D, E, F, G, H, I, J, K, L, M, N, P, Q,R, S, T, U, and then generating ¼ pixel points by using the whole pixelpoints and the ½ pixel points, and finally obtaining reference blockswith subpixel precision by using the whole pixel points, the ½ pixelpoints as well as the ¼ pixel points.

S1006: selecting a motion vector corresponding to a group of up-sampledfirst reference blocks with the minimum matching difference, obtainingcorresponding up-sampled second reference blocks from the two secondreference images respectively, and combining the obtained multipleup-sampled second reference blocks to generate the predicted value ofthe current block.

The at least one motion vector corresponding to the minimum matchingdifference is the candidate motion vector. The motion vectorcorresponding to the minimum matching difference cost from the virBlk1and virBlk2 is selected, and corresponding up-sampled second referenceblocks fn−1 and fn+1 are found from the second reference images to becombined to generate a predicted value pred.

In a case where the distance from the forward predicted frame fn−1 tothe current frame fn is equal to the distance from the backwardpredicted frame fn+1 to the current frame fn, the predicted value of thecurrent block curr_blk may be obtained in the equation (1) below:

f _(n)(x,y)={f _(n−1)(x−½u,y−½v)+f _(n+1)(x+½u,y+½v)}/2  (1)

where u and v are a horizontal component and a vertical component of themotion vector, respectively, and x and y are horizontal coordinateposition and vertical coordinate position of the current block,respectively.

In a case where the distance from the forward predicted frame fn−1 tothe current frame fn is not equal to the distance from the backwardpredicted frame fn+1 to the current frame fn, the predicted value of thecurrent block curr_blk may be obtained in the equation (2) below:

$\begin{matrix}{{f_{n}\left( {x,y} \right)} = {\left\{ {{d\; 2*{f_{n - 1}\left( {{x - {\frac{d\; 1}{{d\; 1} + {d\; 2}}u}},{y - {\frac{d\; 1}{{d\; 1} + {d\; 2}}v}}} \right)}} + {d\; 1*{f_{n + 2}\left( {{x + {\frac{d\; 2}{{d\; 1} + {d\; 2}}u}},{y + {\frac{d\; 2}{{d\; 1} + {d\; 2}}v}}} \right)}}} \right\}/\left( {{d\; 1} + {d\; 2}} \right)}} & (2)\end{matrix}$

where d1 is a temporal distance from fn−1 to fn, d2 is a temporaldistance from fn+1 to fn; and u and v are a horizontal component and avertical component of the motion vector, respectively, and x and y arehorizontal coordinate position and vertical coordinate position of thecurrent block, respectively.

In another embodiment, in order to lower the complexity of processing,it is unnecessary to perform up-sampling on the whole reference image,and up-sampling may only be performed on the part of the reference imagecontaining the content corresponding to the candidate motion vector.This method comprises:

S1001′: for a current block Curr_Blk needed to be encoded, obtaining acurrent template and current candidate motion vectors.

S1002′: obtaining two reference images, and for each candidate motionvector, obtaining a first reference image 1 and a first reference image2 from the two reference images.

S1003′: performing up-sampling on the first reference image 1 and firstreference image 2 by using a first filter, to obtain up-sampled firstreference block 1 and up-sampled first reference block 2, wherein thefirst filter is a simple filter.

The up-sampled first reference block 1 and up-sampled first referenceblock 2 corresponding to each candidate motion vector is a group ofup-sampled first reference blocks.

S1004′: calculating matching differences between the up-sampled firstreference blocks in each group, and selecting the motion vectorcorresponding to the minimum matching difference as the motion vector ofthe current block.

The matching differences between the up-sampled first reference blocksobtained through calculation are sequenced, and a motion vectorcorresponding to a reference template with minimum matching differenceas the motion vector of the current block.

S1005′: selecting motion vectors corresponding to a group of up-sampledfirst reference blocks with the minimum matching difference, to obtaincorresponding second reference blocks respectively from the tworeference images, and performing up-sampling on the second referenceimages by using a second filter to obtain two up-sampled secondreference blocks, wherein the tap number of the first filter is lessthan that of the second filter.

And the motion vector corresponding to the minimum matching differenceis the candidate motion vector.

Interpolations are performed by using a motion compensating filterfilter2, such as a 6-tap filter [1, −5, 20, 20, −5, 1] in H.264/AVCStandard to generate true reference blocks trueRef, as shown in FIG. 3.A particular interpolation generating method is as follows:

generating ½ pixel points by using whole pixel points A, B, C, D, E, F,G, H, I, J, K, L, M, N, P, Q, R, S, T, U through the motion compensatingfilter filter2, and then generating ¼ pixel points by using the wholepixel points and the ½ pixel points, so as to obtain reference blockswith subpixel precision.

S1006′: combining the obtained multiple up-sampled second referenceblocks to generate the predicted value of the current block.

A motion vector corresponding to the minimum matching difference costbetween the virBlk1 and virBlk2 is selected, and corresponding secondreference blocks are found from the reference image, and the secondreference blocks are up-sampled to obtain up-sampled second referenceblocks fn−1 and fn+1, which are then combined to generate the predictedvalue pred.

The method may be applicable to an encoder, or to a decoder. For theencoder, after the predicted value of the current block is obtained, aresidual value between the current block and the predicted value iscalculated, and then transmitted to the decoder after being encoded. Forthe decoder, a restored value of the current block is obtained by addingthe residual value of the current block obtained through decoding andthe combined predicted value pred of the current block.

In accordance with this embodiment, during a motion search, the use offilter, which is simpler than the filter used in motion compensation,may maintain the properties and lower the complexity of calculation.Also, it is possible to lower complexity without compromising theprecision, which is achieved by obtaining the candidate motion vectorrequired for the motion compensation process using the matchingdifference between the current template and the reference templateduring the motion search process, and by performing predictive searchusing the surrounding known information. Moreover, in the process ofup-sampling, up-sampling is performed only on the reference blockscorresponding to a position indicated by the motion vectors, which leadsto a simpler algorithm and higher efficiency.

With reference to FIG. 11, an embodiment of the device for generating apredicted value of an image provided by the present invention is shown,the device comprising: a determining unit 1101, configured to determinea searching scope, wherein multiple motion vectors are included in thesearching scope; a first up-sampling unit 1102, configured to performup-sampling interpolations on first reference blocks by using a firstfilter to obtain up-sampled first reference blocks, wherein the firstreference blocks are reference blocks in a reference image of thecurrent block corresponding to the motion vector in the searching scope;a motion searching unit 1103, configured to obtain at least onecandidate motion vector corresponding to the current block by using theup-sampled first reference blocks; a second up-sampling unit 1104,configured to perform up-sampling interpolations on second referenceblocks by using a second filter to obtain up-sampled second referenceblocks, wherein the second reference blocks are reference blocks in thereference image of the current block corresponding to the at least onecandidate motion vector, and a combining unit 1105, configured tocombine the up-sampled second reference blocks to generate a predictedvalue of the current block.

In another embodiment, the tap number of the first filter is less thanthat of the second filter. In particular, for example, the first filteris a bilinear filter, and the second filter is a 6-tap filter or anadaptive filter.

In an embodiment, the determining unit is configured to determine acandidate motion vector set of the current block, and to take thedetermined candidate motion vector set as the searching scope. And inanother embodiment, the determining unit comprises: a search startingpoint determining unit configured to obtain a search starting point of acurrent template; and an obtaining unit configured to obtain thesearching scope according to the search starting point and a predefinedsearching region.

With reference to FIG. 12, an embodiment of the motion searching unitcomprises: an information obtaining unit 1201, configured to obtain acurrent template of the current block; a template obtaining unit 1202,configured to, for each candidate motion vector, obtain a referencetemplate corresponding to the current template from the up-sampled firstreference blocks; a calculating unit 1203, configured to calculatematching differences between the current template and each referencetemplate, respectively; and a determining unit 1204, configured toselect at least one motion vector corresponding to at least onereference template with minimum matching difference, and to take theselected at least one motion vector as the candidate motion vector ofthe current block.

Another embodiment of the motion searching unit comprises: aninformation obtaining unit configured to obtain a current template ofthe current block: a template obtaining unit configured to, for eachsearching point in the searching scope, obtain a reference templatecorresponding to the current template from the up-sampled firstreference blocks; a calculating unit configured to calculate matchingdifferences between the current template and each reference template,respectively; and a determining unit configured to select at least onemotion vector corresponding to at least one reference template withminimum matching difference, and to take the selected at least onemotion vector as the candidate motion vector of the current block.

With reference to FIG. 13, still another embodiment of the motionsearching unit comprises: a reference block obtaining unit 1301configured to, for each candidate motion vector, obtain two up-sampledfirst reference blocks; a calculating unit 1302 configured to calculatea matching difference of each group of the up-sampled first referenceblocks; and a determining unit 1303 configured to select a motion vectorcorresponding to the up-sampled first reference blocks with a minimummatching difference, and to take the selected motion vector as thecandidate motion vector of the current block.

The device may be applied to a decoder or an encoder, and the encoderfurther comprises: an encoding unit configured to calculate a residualvalue between the current block and the predicted value, and transmitthe residual value to a decoder after encoding.

The decoder further comprises: a decoding unit configured to decode theobtained residual value of the current block, and to obtain a restoredvalue of the current block by adding the residual value of the currentblock to the predicted value of the current block.

The first up-sampling unit and second up-sampling unit may performup-sampling on the whole reference image, and then obtain up-sampledreference blocks corresponding to the motion vectors in the searchingscope; alternatively, the first up-sampling unit and second up-samplingunit may also find reference blocks corresponding to the motion vectorsin the searching scope, and then obtain up-sampled reference blocks byperforming up-sampling on the reference blocks.

The implementation details of the embodiments of the method explained inthe description are also applicable to the implementation details of theembodiments of the device.

The embodiments of the method and device may be applicable to an encoderor a decoder. According to these embodiments, during a motion search, afilter that is simpler than that used in a motion compensation processcan maintain the properties and lower the complexity of calculation. Inthe process of motion search, a matching difference between tworeference blocks corresponding to reference images is used to obtain thecandidate motion vector needed in the process of motion compensation,and a predictive search is performed by using the known informationaround the current block, lowering the complexity without compromisingthe accuracy. Furthermore, in the process of up-sampling, up-samplingprocessing is performed on the reference image corresponding to theposition indicated by the motion vectors, which can lower the complexityof the algorithm and improve the efficiency of processing.

The various illustrative logical blocks, modules, and circuits describedin connection with the embodiments disclosed herein may be implementedor performed with a general purpose processor, a digital signalprocessor (DSP), an application specific integrated circuit (ASIC), afield programmable gate array (FPGA) or other programmable logic device,discrete gate or transistor logic, discrete hardware components, or anycombination thereof designed to perform the functions described herein.The general purpose processor may be a microprocessor, but in thealternative, the processor may be any conventional processor,controller, microcontroller, or state machine. The processor may also beimplemented as a combination of computing devices, e.g., a combinationof a DSP and a microprocessor, a plurality of microprocessors, one ormore microprocessors in conjunction with a DSP core, or any other suchconfiguration.

It may be understood by an ordinary skilled person in the art that allor part of the processes of the embodiments of the method may beimplemented by related hardware under instructions of a computer programstored in a computer-readable storage medium. The execution of theprogram may comprise the processes of the above embodiments of themethod. The storage medium may be one of a floppy disk, a compact disc(CD), a read only memory (ROM), or a random access memory (RAM), etc.

The embodiments of the method and device may be applicable to an encoderor a decoder. The method and device of the embodiments of the presentinvention may be applicable to various electronic devices or devicesrelated to them, for example, a mobile phone, a wireless device, apersonal digital assistant (PDA), a hand-held or portable computer, aglobal positioning system (GPS) receiver/navigator, a still camera, anaudio/video player, a video camera, a video recorder, or a monitoringdevice, etc.

What are described above are embodiments of the present invention, andthose skilled in the art may make various modifications or variants tothe present invention according to the disclosure of this applicationwithout departing from the spirits and scope of the present invention.

What is claimed is:
 1. A method for generating a predicted value of animage comprising: determining a searching scope, wherein multiple motionvectors are included in the searching scope; performing up-samplinginterpolations on first reference blocks by using a first filter toobtain up-sampled first reference blocks, wherein the first referenceblocks comprise reference blocks in a reference image of a current blockcorresponding to the motion vector in the searching scope; obtaining atleast one candidate motion vector corresponding to the current block byusing the up-sampled first reference blocks; performing up-samplinginterpolations on second reference blocks by using a second filter toobtain up-sampled second reference blocks, wherein the second referenceblocks comprise reference blocks in the reference image of the currentblock corresponding to the at least one candidate motion vector; andcombining the up-sampled second reference blocks to generate a predictedvalue of the current block during image encoding or decoding.
 2. Themethod according to claim 1, wherein a tap number of the first filter isless than a tap number of the second filter.
 3. The method according toclaim 2, wherein the first filter is a bilinear filter, and wherein thesecond filter is a 6-tap filter or an adaptive filter.
 4. The methodaccording to claim 1, wherein determining a searching scope comprisesobtaining a candidate motion vector set of the current block as thesearching scope.
 5. The method according to claim 1, wherein determininga searching scope comprises: obtaining a search starting point of acurrent template according to motion vectors around the current block;and obtaining the searching scope according to the search starting pointand a predefined searching region.
 6. The method according to claim 1,wherein performing up-sampling interpolations on first reference blocksby using a first filter to obtain up-sampled first reference blockscomprises: performing up-sampling interpolations on the reference imageby using the first filter to obtain a first reference image; andobtaining, from the first reference image, the up-sampled firstreference blocks corresponding to the motion vectors in the searchingscope.
 7. The method according to claim 1, wherein performingup-sampling interpolations on first reference blocks by using a firstfilter to obtain up-sampled first reference blocks comprises: obtaining,from the reference image, first reference blocks corresponding to themotion vectors in the searching scope; and performing up-samplinginterpolations on the first reference blocks by using the first filterto obtain the up-sampled first reference blocks.
 8. The method accordingto claim 4, wherein obtaining at least one candidate motion vectorcorresponding to the current block by using the up-sampled firstreference blocks comprises: obtaining a current template; obtaining, foreach motion vector in the searching scope, a reference template to whichthe current template corresponds, from the up-sampled first referenceblocks; calculating respectively matching differences between thecurrent template and each reference template; and selecting at least onemotion vector corresponding to at least one reference template withminimum matching difference as the candidate motion vector correspondingto the current block.
 9. The method according to claim 5, whereinobtaining at least one candidate motion vector corresponding to thecurrent block by using the up-sampled first reference blocks comprises:obtaining a current template; obtaining, for each motion vector in thesearching scope, a reference template to which the current templatecorresponds, from the up-sampled first reference blocks; calculatingrespectively matching differences between the current template and eachreference template; and selecting at least one motion vectorcorresponding to at least one reference template with minimum matchingdifference as the candidate motion vector corresponding to the currentblock.
 10. The method according to claim 4, wherein obtaining at leastone candidate motion vector corresponding to the current block by usingthe up-sampled first reference blocks comprises: calculating, for eachcandidate motion vector, matching difference between up-sampled firstreference blocks in a corresponding group, wherein the number of thereference images is 2, and the number of the up-sampled first referenceblocks is also 2; and selecting a motion vector corresponding to a groupof up-sampled first reference blocks with minimum matching difference asthe candidate motion vector corresponding to the current block.
 11. Themethod according to claim 4, wherein obtaining at least one candidatemotion vector corresponding to the current block by using the up-sampledfirst reference blocks comprises: calculating, for each candidate motionvector, matching difference between the up-sampled first referenceblocks in a corresponding group, wherein the number of the referenceimages is 2, and the number of the up-sampled first reference blocks isalso 2; calculating, for each candidate motion vector, matchingdifference between the current template and a reference templatecorresponding to the current template and obtained from the at least oneup-sampled first reference block; and selecting a motion vectorcorresponding to a group of the up-sampled first reference blocks inwhich a combination of the sum of the two matching differences areminimum, and taking the selected motion vector as the candidate motionvector corresponding to the current block.
 12. The method according toclaim 1, wherein performing up-sampling interpolations on secondreference blocks by using a second filter to obtain up-sampled secondreference blocks comprises: performing up-sampling interpolations on thereference image by using the second filter to obtaining a secondreference image; obtaining, from the second reference image, theup-sampled second reference blocks corresponding to the candidate motionvector, and wherein the second reference blocks comprise referenceblocks in the reference image of the current block corresponding to theat least one candidate motion vector.
 13. The method according to claim1, wherein performing up-sampling interpolations on second referenceblocks by using a second filter to obtain up-sampled second referenceblocks comprises: obtaining, from the reference image, second referenceblocks corresponding to the candidate motion vector; performingup-sampling interpolations on the second reference blocks by using thesecond filter to obtain the up-sampled second reference blocks, andwherein the second reference blocks comprise reference blocks in thereference image of the current block corresponding to the at least onecandidate motion vector.
 14. A device for generating a predicted valueof an image comprising: a determining unit configured to determine asearching scope, wherein multiple motion vectors are included in thesearching scope; a first up-sampling unit configured to performup-sampling interpolations on first reference blocks by using a firstfilter to obtain up-sampled first reference blocks, wherein the firstreference blocks comprise reference blocks in a reference image of acurrent block corresponding to the motion vector in the searching scope;a motion searching unit configured to obtain at least one candidatemotion vector corresponding to the current block by using the up-sampledfirst reference blocks; a second up-sampling unit configured to performup-sampling interpolations on second reference blocks by using a secondfilter to obtain up-sampled second reference blocks, wherein the secondreference blocks comprise reference blocks in the reference image of thecurrent block corresponding to the at least one candidate motion vector;and a combining unit configured to combine the up-sampled secondreference blocks to generate a predicted value of the current blockduring image encoding.
 15. The device according to claim 14, wherein atap number of the first filter is less than a tap number of the secondfilter.
 16. The device according to claim 15, wherein the first filteris a bilinear filter, and wherein the second filter is a 6-tap filter oran adaptive filter.
 17. The device according to claim 14, wherein thedetermining unit is configured to obtain a candidate motion vector setof the current block as the searching scope.
 18. The device according toclaim 17, wherein the motion searching unit comprises: an informationobtaining unit configured to obtain a current template of the currentblock; a template obtaining unit configured to, for each candidatemotion vector, obtain a reference template corresponding to the currenttemplate from the up-sampled first reference blocks; a calculating unitconfigured to calculate respectively matching differences between thecurrent template and each reference template; and a determining unitconfigured to select at least one motion vector corresponding to atleast one reference template with minimum matching difference as thecandidate motion vector of the current block.
 19. The device accordingto claim 17, wherein the motion searching unit comprises: a referenceblock obtaining unit configured to, for each candidate motion vector,obtain two up-sampled first reference blocks; a calculating unitconfigured to calculate respectively matching difference of each groupof the up-sampled first reference blocks; and a determining unitconfigured to select a motion vector corresponding to the up-sampledfirst reference blocks with minimum matching difference as the candidatemotion vector of the current block.
 20. The device according to claim14, wherein the determining unit comprises: a search starting pointdetermining unit configured to obtain a search starting point of acurrent template; and an obtaining unit configured to obtain thesearching scope according to the search starting point and a predefinedsearching region.